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1 Optimizing queries using materialized views: a practical, scalable solution 
Jonathan Goldstein, Per-Ake Larson 
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international conference on Management of data SIGMOD '01, Volume 
30 Issue 2 
Publisher: ACM Press 

Full text available: ^ [pdf(202.08 Additional Information: full citation , abstract , references , citings , 
KB) index terms , review 



Materialized views can provide massive improvements in query processing time, 
especially for aggregation queries over large tables. To realize this potential, the 
query optimizer must know how and when to exploit materialized views. This paper 
presents a fast and scalable algorithm for determining whether part or all of a query 
can be computed from materialized views and describes how it can be incorporated 
in transformation-based optimizers. The current version handles views composed of 
sele ... 

Keywords: materialized views, query optimization, view matching 
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Dieter Baer, Klaus Sun, Leon Treff 

February 1990 ACM SIGAda Ada Letters, Volume X Issue 2 
Publisher: ACM Press 
Full text available: fg| pdf(874.40 



KB) 



Additional Information: full citation , abstract , index terms 



The use of the programming language Ada for the development of commercial 
applications largely depends on the availability of interfaces to database and 
interactive communication systems. This paper introduces an interface from Ada to 
the database language SQL. The approach taken in this paper is different from that 
taken in the interface definition Ada/SQL which is proposed as a binding of the ANSI 
standard database language SQL. This interface is defined as an Ada package with 
nested generic p ... 



Laws and applications: Privacy preserving database application testing 
Xintao Wu, Yongge Wang, Yuliang Zheng 

October 2003 Proceedings of the 2003 ACM workshop on Privacy in the 

electronic society 
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Full text available: fj| | pdf(1 75.62 Additiona | information: full citation , abstract , references , index terms 
KB) 

Traditionally, application software developers carry out their tests on their own local 
development databases. However, such local databases usually have only a small 
number of sample data and hence cannot simulate satisfactorily a live environment, 
especially in terms of performance and scalability testing. On the other hand, the 
idea of testing applications over live production databases is increasingly problematic 
in most situations primarily due to the fact that such use of liv ... 

Keywords: database application testing, indistinguishability, privacy 
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Publisher: ACM Press 
Full text available: *gjpdf(20077 



Additional Information: full citation , abstract , references , index terms 
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Recently, techniques for supporting efficient similarity search over huge transaction 
datasets have emerged as an important research area. Several indexing schemes 
have been proposed towards this direction. Typically, these schemes provide a 
tradeoff between searching efficiency and indexing overhead in terms of space. 

In this paper, we propose a novel indexing scheme for similarity search on 
transaction data. Based on well-studied clustering techniques, we develop a 
construction algor ... 

Keywords: data mining, indexing, similarity search, transaction data 
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Publisher: ACM Press 

Full text available: fglpdf(3.34 MB) Additional Information: full citation , abstract, references , citinas, 
L -' index terms , review 

Maintaining the integrity of databases is one of the promises of database 
management systems. This includes assuring that integrity constraints are invariants 
of database transactions. This is very difficult to accomplish efficiently in the 
presence of complex constraints and large amounts of data. One way to minimize 
the amount of processing required to maintain database integrity over transaction 
processing is to prove at compile-time that transactions cannot, if run atomically, 
disobey i ... 

7 Towards an object-centered database language 
Martin L. Kersten, Frans H. Schippers 

September 1986 Proceedings on the 1986 international workshop on 
Object-oriented database systems 
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Publisher: IEEE Computer Society Press 

Full text available: ^g ] pdf(752.30 Additional Information: full citation , abstract , references , citings , 
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In this report we discuss ongoing research in the area of object-oriented database 
systems at the CWI. The central theme of this paper is the friction encountered when 
using an object-oriented (O-O) language, such as Smalltalk, in the database arena. A 
series of (open) database issues is given for which the object-oriented paradigm 
does not provide an elegant solution. A refinement of the 0-0 concepts is given 
which emphasizes the dynamic classification of objects through its characteristic ... 

8 Selection conditions in main memory { 
Kenneth A. Ross 

March 2004 ACM Transactions on Database Systems (TODS), Volume 29 Issue 1 
Publisher: ACM Press 

Full text available: ^ | pdf(296.54 Additiona | information: full citation , abstract , references , index terms 
KB) 

We consider the fundamental operation of applying a compound filtering condition to 
a set of records. With large main memories available cheaply, systems may choose 
to keep the data entirely in main memory, in order to improve query and/or update 
performance.The design of a data-intensive algorithm in main memory needs to take 
into account the architectural characteristics of modern processors, just as a 
disk-based method needs to consider the physical characteristics of disk devices. An 
importa ... 

Keywords: Branch misprediction 



Model independent assertions for integration of heterogeneous schemas 
Stefano Spaccapietra, Christine Parent, Yann Dupont 

July 1992 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 1 Issue 1 
Publisher: Springer-Verlag New York, Inc. 

Full text available: Qpdf(2.15 MB) Additional Information: full citation , abstract , references , citings 

Due to the proliferation of database applications, the integration of existing 
databases into a distributed or federated system is one of the major challenges in 
responding to enterprises' information requirements. Some proposed integration 
techniques aim at providing database administrators (DBAs) with a view definition 
language they can use to build the desired integrated schema. These techniques 
leave to the DBA the responsibility of appropriately restructuring schema elements 
from existing I ... 

Keywords: conceptual modeling, database design and integration, distributed 
databases, federated databases, heterogeneous databases, schema integration 



10 Semantic interfaces and OWL tools: Semantic email 
Luke McDowell, Oren Etzioni, Alon Halevy, Henry Levy 

May 2004 Proceedings of the 13th international conference on World Wide Web 
Publisher: ACM Press 

Full text available: ^ | pdf(508.79 Additjona , information: full citation , abstract , references , index terms 
KB) 

This paper investigates how the vision of the Semantic Web can be carried overto 
the realm of email. We Introduce a general notion of semantice mail, in which an 
email message consists of an RDF query or update coupled with corresponding 
explanatory text. Semantic email opens the door to a wide range of automated, 
email-mediated applications with formally guaranteed properties. In particular, this 
paper introduces a broad class of semantic email processes. For example consider 
the process ... 

Keywords: decision-theoretic, formal model, satisfiability, semantic web 
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11 The design of Star's records processing: data processing for the noncomputer Q 
professional 

Robert Purvy, Jerry Farrell, Paul Klose 
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Publisher: ACM Press 
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12 Research papers: data cleaning and mapping: Supporting executable mappings 
in model management 

Sergey Melnik, Philip A. Bernstein, Alon Halevy, Erhard Rahm 
June 2005 Proceedings of the 2005 ACM SIGMOD international conference on 

Management of data 
Publisher: ACM Press 

Full text available: f j| pdf(4Q8.49 Addjtjona| , nforrnalion: f^ajion, abstract , references 
KB) 

Model management is an approach to simplify the programming of 
metadata-intensive applications. It offers developers powerful operators, such as 
Compose, Diff, and Merge, that are applied to models, such as database schemas or 
interface specifications, and to mappings between models. Prior model management 
solutions focused on a simple class of mappings that do not have executable 
semantics. Yet many metadata applications require that mappings be executable, 
expressed in SQL, XSLT, or other data ... 

13 A module for improving data access and management in an integrated CAD 
environment 

G. P. Barabino, G. S. Barabino, G. Bisio, M. Marchesi 
June 1985 Proceedings of the 22nd ACM/IEEE conference on Design automation 
Publisher: ACM Press 

Full text available: | ||pdf(722.13 Additjona | information: full citation , abstract , references , index terms 
KB) 

A modular system is presented for handling design data, centered around the 
relational DBMS Ingres, taking advantage of a very careful database schema 
generation and of the use of software modules to comply with the specific 
requirements of design data. This work is mainly concerned with one of these 
modules, called LIPS, which enables the system to handle design data local to an 
application program with a high level query language, interactive level performances 
and the ability ... 

14 Special issue: Al in engineering 
D. Sriram, R. Joobbani 
April 1985 ACM SIGART Bulletin, Issue 92 
Publisher: ACM Press 

Full text available: pdf(8.79 MB) Additional Information: full citation , abstract 

The papers in this special issue were compiled from responses to the announcement 
in the July 1984 issue of the SIGART newsletter and notices posted over the 
ARPAnet. The interest being shown in this area is reflected in the sixty papers 
received from over six countries. About half the papers were received over the 
computer network. 

15 Constructing information systems based on schema reuse 
Wen-Syan Li, Richard D. Holowczak 

November 1996 Proceedings of the fifth international conference on Information 
and knowledge management 

Publisher: ACM Press 

Full text available: | g| pdf(945.90 Additiona , information- full citation , references , index terms 
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20 Fast computation of low rank matrix approximations 
Dimitris Achlioptas, Frank McSherry 

July 2001 Proceedings of the thirty-third annual ACM symposium on Theory of 

computing 
Publisher: ACM Press 

Full text available: ^g j pdf(223.29 Additional Information: full citation , abstract , references , citings , 
KB) index terms 

Given a matrix A it is often desirable to find an approximation to A that has low rank. 
We introduce a simple technique for accelerating the computation of such 
approximations when A has strong spectral structure, i.e., when the singular values 
of interest are significantly greater than those of a random matrix with size and 
entries similar to A. Our technique amounts to independently sampling and/or 
quantizing the entries ... 
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16 Research sessions: query uncertainty: Efficient set joins on similarity predicates .Q 
^ Sunita Sarawagi, Alok Kirpal 

^7 June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 
Management of data 

Publisher: ACM Press 

Full text available: ^ pdf(265.08 Addjtiona| | nformation: f uN citation , abstract , references 
KB) 

In this paper we present an efficient, scalable and general algorithm for performing 
set joins on predicates involving various similarity measures like intersect size, 
Jaccard-coefficient, cosine similarity, and edit-distance. This expands the existing 
suite of algorithms for set joins on simpler predicates such as, set containment, 
equality and non-zero overlap. We start with a basic inverted index based probing 
method and add a sequence of optimizations that result in one to two orders of 
magn ... 

17 STHoles: a multidimensional workload-aware histogram Q 
Nicolas Bruno, Surajit Chaudhuri, Luis Gravano 

May 2001 ACM SIGMOD Record , Proceedings of the 2001 ACM SIGMOD 
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Attributes of a relation are not typically independent. Multidimensional histograms 
can be an effective tool for accurate multiattribute query selectivity estimation. In 
this paper, we introduce STHoles, a "workload-aware" histogram that allows bucket 
nesting to capture data regions with reasonably uniform tuple density. STHoles 
histograms are built without examining the data sets, but rather by just analyzing 
query results. Buckets are allocated where needed the mos ... 
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In this paper we describe the design and implementation of a static array-bound 
checker for a family of embedded programs: the flight control software of recent 
Mars missions. These codes are large (up to 280 KLOC), pointer intensive, heavily 
multithreaded and written in an object-oriented style, which makes their analysis 
very challenging. We designed a tool called C Global Surveyor (CGS) that can 
analyze the largest code in a couple of hours with a precision of 80%. The scalability 
and precisi ... 

Keywords: abstract interpretation, array-bound checking, difference-bound 
matrices, pointer analysis, program verification 
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